Runtime Data Declustering based on Bandwidth-on-Demand and its Evaluation over SAN-connected PC cluster

نویسندگان

  • Masato Oguchi
  • Masaru Kitsuregawa
چکیده

Clusters of computers are used in large scale server sites recently, because of their good scalability and cost/performance ratio. In addition, Storage Area Network (SAN) is introduced in order to consolidate back end of such systems. I/O-bottleneck is serious problem in such an environment, because some important data-intensive applications often access part of data concurrently and repeatedly through SAN. In order to resolve this problem, an idea of “Bandwidth-on-Demand” is introduced in this paper. Runtime declustering middleware based on this idea is implemented on SAN-connected PC cluster, and evaluated with data mining application. According to the results of experiments, an overhead of this middleware is trivial, and I/O-bottleneck problem is resolved after declustering is completed, therefore, preferably good performance improvement is achieved. The runtime declustering middleware achieves better performance improvement especially when the number of nodes used in the cluster is large, because I/O-bottleneck problem becomes serious in such a case. Furthermore, data migration after declustering costs minimal overhead.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance Analysis of Runtime Data Declustering over SAN-Connected PC Cluster

Personal computer/workstation (PC/WS) clusters have come to be studied intensively in the field of parallel and distributed computing. They are considered to play an important role as a large scale computer system in the next generation, such as large server sites and/or high performance parallel computers, because of their good scalability and cost performance ratio. In the viewpoint of applic...

متن کامل

Runtime Data Declustering over SAN-Connected PC Cluster System

Recently, personal computer/workstation (PC/WS) clusters have come to be studied intensively in the field of parallel and distributed computing. In the viewpoint of applications, data intensive applications including data mining and ad-hoc query processing in databases are considered very important for massively parallel processors, in addition to the conventional scientific calculation. Thus, ...

متن کامل

Run-Time Load Balancing System on SAN-connected PC Cluster for Dynamic Injection of CPU and Disk Resource - A Case Study of Data Mining Application

PC cluster system is an attractive platform for data-intensive applications. But the conventional shared-nothing system has a limit on load balancing performance and it is difficult to change the number of nodes and disks dynamically during execution. In this paper, we develop dynamic resource injection, where the system can inject CPU power and expand I/O bandwidth by adding nodes and disks dy...

متن کامل

Implementation and Evaluation of Parallel Data Mining on PC Cluster and Optimization of its Execution Environments

Personal Computer/Workstation clusters have been studied intensively in the field of parallel and distributed computing. In the viewpoint of applications, data intensive applications such as data mining and ad-hoc query processing in databases are considered very important for high performance computing, as well as conventional scientific calculations. We have built and evaluated PC cluster pil...

متن کامل

Data mining on PC cluster connected with storage area network: its preliminary experimental results

Personal computer/Workstation (PC/WS) clusters have become a hot research topic recently in the field of parallel and distributed computing. They are considered to play an important role as a large scale computer system, such as large server sites and/or high performance parallel computers, because of their good scalability and cost performance ratio. In the viewpoint of applications, data inte...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002